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Abstract 

We investigate the achievable error probability in communication over an AWGN discrete time memoryless 
channel with noiseless delay-less rate-limited feedback. For the case where the feedback rate Rp^ is lower than 
the data rate R transmitted over the forward channel, we show that the decay of the probability of error is at 
(^ , most exponential in blocklength, and obtain an upper bound for increase in the error exponent due to feedback. 

CN ■ Furthermore, we show that the use of feedback in this case results in an error exponent that is at least Rpg higher 

$— ( . than the error exponent in the absence of feedback. For the case where the feedback rate exceeds the forward 

r^ rate (RpB > R), we propose a simple iterative scheme that achieves a probability of error that decays doubly 

exponentially with the codeword blocklength n. More generally, for some positive integer L, we show that a L*'' 
f^ ■ order exponential error decay is achievable if Rp^ > {L — 1)R. We prove that the above results hold whether 

CSJ . the feedback constraint is expressed in terms of the average feedback rate or per channel use feedback rate. Our 

■ results show that the error exponent as a function of Rp^ has a strong discontinuity at R, where it jumps from a 

r I , finite value to infinity. 
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While feedback cannot increase the capacity of a point-to-point memoryless channel, it can decrease 

T 1 

<^ ' the probability of error as well as the complexity of the encoder and decoder. For an AWGN channel 

^ without feedback, it is known flTJI that the decay in the probability of error as a function of the blocklength 

n is at most exponential in the absence of feedback (i.e. the lowest achievable probability of error has 

the general form P^ = exp(— 0(n)))ll| However, when a noiseless delay-less infinite capacity feedback 

(^ ■ link is available, a simple sequential linear scheme (the Schalkwijk-Kailath scheme L2J) can achieve the 

O ' capacity of this channel with a doubly exponential decay in the probability of error as a function of 

^ ; the blocklength (i.e. it has the general form P^ = exp(— exp(i7(r2)))). This shows the significant role of 

;L| ' feedback in reducing the probability of error. 

• ^ The Schalkwijk-Kailath scheme requires a noiseless feedback link with infinite capacity. In fact, the 

rS ! Schalkwijk-Kailath scheme does not provide the best possible error decay rate given such an ideal 

c3 ; feedback link. In particular, it is shown in ^ that in the presence of an ideal noise-free delay-less 

feedback link, the capacity of the AWGN channel can be achieved with a probability of error that 

decreases with an exponential order which is linearly increasing with blocklength (i.e. it has the general 

form Pg = exp(— exp o ... o exp(r2(n))))a However, once the feedback channel is corrupted with some 

Q(n) times 

noise, the benefits of feedback in terms of the error probability decay rate can drop. In fact, when this 
corruption corresponds to an additive white Gaussian noise on the feedback channel, the Schalkwijk- 
Kailath communication scheme (or any other linear scheme) fails to achieve any nonzero rate with 
vanishing error probability [4J. Furthermore, in this case, the achievable error decay for any coding 
scheme can be no better than exponential in blocklength [5J, similar to the case without feedback [[T]|. 

R. Mirghaderi, A. Goldsmith and T. Weissman are with the Department of Electrical Engineering, Stanford University, 350 Serra Mall, Stan- 
ford, CA, USA, email: rezam@stanford.edu, andrea@wsl.stanford.edu, tsachy@stanford.edu. This 
paper was presented in part at the 48' ^ Annual AUerton Conference on Communications, Control and Computing, 2010. This work is 
supported by the NSF Center for the Science of Information under Award CCF-0939370. 

'Given a function h{.), h{n) — 0{n) is equivalent to liin,i^oc-^ < cx), and h{n) = Q,{n) is equivalent to Iini „ ,^„— ^ > 0. 

^Operator o is used to denote function composition. 



In this work, we consider a case where the feedback link is noiseless and delay-less but rate-limited. 
The advantages of rate-limited feedback in reducing the coding complexity are investigated in (611. In this 
paper, we study the benefits of rate limited feedback in terms of decreasing the error probability. Assuming 
a positive and feasible (below capacity) rate R is to be transmitted on the forward channel, we characterize 
the achievable error decay rates in two cases: the case where the feedback rate, Rfb, is lower than R, 
and the case where RpB > R- For the first scenario, we show that the best achievable error probability 
decreases exponentially in the code blocklength n (i.e. Pg = exp(— 0(n))) and provide an upper bound 
for the error exponent. For the second scenario, we propose an iterative coding scheme which achieves a 
doubly exponential error decay (i.e. Pe = exp(— exp(i7(ri)))). Since a feedback rate equal to the data rate 
is sufficient for achieving a doubly exponential error decay, one might suspect that further increasing the 
feedback rate may not lead to a significant gain. We dispel this suspicion by generalizing our proposed 
iterative scheme to show that if R^b > (L — 1)R, an L*'^ order exponential decay is achievable. The latter 
result is consistent with [7J, in which the achievable error probabilities are characterized in terms of the 
number of times the (infinite capacity) feedback link is used. 

Interestingly, our results show that the error exponent as a function of the feedback rate has a strong 
discontinuity at the point Rpg = R; it is finite for Rpg < R and infinite for Rpg > R (due to the 
achievability of a doubly exponential error decay). 

Although only Rp^ > R can lead to a super-exponential error decay, even for smaller feedback rates, 
we expect to have a strictly higher error decay rate as compared to the case with no feedback. In particular 
we show that for Rpg < R, the error exponent is at least Rpg higher than the error exponent in the absence 
of feedback. 

The problem of communication over the AWGN channel with limited feedback has been previously 
considered assuming different types of corruption on the feedback channel. In particular, the corruption 
on the feedback channel has been modeled as additive Gaussian noise in [41 and [SJ and as quantization 
noise in [8]. Another type of feedback corruption has been considered in fOl where only a subsequence 
of the channel outputs can be sent back noiselessly to the transmitter. A fundamental distinction between 
our model and the ones considered above is that in our model the receiver has "full control" over what is 
transmitted and received on the feedback link. This is due to the fact that under the rate-limited feedback 
scenario, the feedback link is assumed to be both noiseless and active in the sense that at each time, the 
feedback message is allowed to be an encoded function of all the information available at the receiver at 
that time. Communication with imperfect feedback has also been investigated in [fTOll . [ITTIl and [11211 for 
variable-length coding strategies. Our model on the other hand captures a scenario where the blocklength 
and therefore the decoding delay is fixed. 

The rest of this paper is organized as follows: In Section II we present the system model and the problem 
formulation. In Section III we consider the case where the feedback rate is higher than the forward rate. 
Specifically, using a simple iterative coding scheme we show the achievability of an L^'^ order exponential 
error decay when Rpg > (L — 1)R. In Section IV we consider the case where Rpg < R and show that 
in this case the decay in probability of error is at most exponential (finite first order error exponent). 
Although a feedback rate less than R cannot provide super-exponential error decay, we will show in 
Section V that it increases the error exponent by at least Rpg- Section VI shows that the necessary and 
sufficient conditions for super-exponential error decay remain the same even if we express the feedback 
limitation as a constraint on the per channel use feedback rate instead of the average feedback rate. Finally, 
Section VII concludes the paper. 

Notation. Throughout this paper we represent the L2 norm operator by 1 1 . 1 1 and the expectation operator by 
E[.]. The notation "log" is used for the natural logarithm, and rates are expressed in nats. The complement 
of a set A is denoted by A'^. We denote the indicator function of the event ^ by 1^. Given a function 
h{.), h{n) = 0(1) is equivalent to lim^^oo \h{n)\ = 0. Given a function h{.) and a positive integer k, the 
k^'^ iterate of the function, i.e. h o . . . o h (.), is denoted by h''{.). 
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Fig. 1. AWGN channel with rate-limited feedback 



II. System Model 

We consider communication over a block of length n through an AWGN channel with rate-limited 
noiseless feedback. The channel output Yi at time i is given by 

Y, = X, + iV„ 

where {Ni}f^-^ is a white Gaussian noise process with Ni ~ A/'(0, 1) and Xj is the channel input at time 
i. The finite- alphabet feedback signal at time i is denoted by Ui E Ui and is assumed to be decoded at 
the transmitter (of the forward channel) without any error or delay. We will denote the feedback sequence 
alphabet Wi x ... x W„ by W. The message m to be transmitted (on the forward link) is assumed to be 
drawn uniformly from the set A^ = {1, ..., |A^|}. 

An encoding strategy is comprised of a sequence of functions {/j }"=i where fl : Ai x Ui x ... x 
Wj_i I— 7> M determines the input Xj as a function of the message and the feedback signals received before 
time i, 

X, = /f)(m,[/i,...,[/,„i). 

The feedback strategy consists of a sequence of functions {fifj }"=i where g-'^' -.Wh^Ui determines the 
feedback signal as a function of the channel outputs up to time i, 

U, = gt\Y,,...,Y^). 

The decoding function : M* i— t- A^ gives the reconstruction of the message after receiving all the channel 
outputs 

m = 0(")(F"). 

The probability of error for message m is denoted by Pe{m), where 

Pe{iTL) = Pr{m ^ m\m is transmitted}. 
The average probability of error is defined as 

^ \M\ 



m=l 



Given the above setup, a communication scheme with forward rate R, feedback rate Rp^ and power 
level P is comprised of a selection for the feedback sequence alphabet U, the encoding strategy {fp}f^i, 
the feedback strategy {g^}^=i and the decoding function 0(")(.), such that 
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where the expectation is with respect to the messages and the noise. Over all such communication schemes, 
we represent the one with minimum average probability of error with the tuple (n, R, Rfb, P) and denote 
the corresponding minimum error probability by Pe{n, R, Rfb, P)- In the case where the feedback rate is 
zero, we simply drop the feedback rate term and use (n, R, P) and Pe{n, R, P) to represent the optimal 
non-feedback code and the corresponding error probability, respectively. The capacity of the AWGN 
channel is denoted by C, where 

C = ^\og{l + P). 

For the communication system described above, the first order error exponent or simply the error 
exponent is defined as 

n 
where a positive value of the error exponent implies that the error decay rate is at least exponential. We 
also define higher order error exponents. In particular, given L > 2, the L*^ order error exponent is defined 
as 

El [R, Rfb ,P) = lim„_,oo • (2) 

n 

Given the above definitions, a communication system with strictly positive L*'* order error exponent has 
an L*^ order exponential error decay (i.e. Pe{n, R, Rfb, P) = exp(— exp'^~^(r2(n)))). 

III. Rfb > R- Super-Exponential Error Decay 

When the feedback rate is higher than the forward rate R, we can achieve a super-exponential (in 
blocklength) error decay. This result is presented in the following theorem. 

Theorem 1 For any R > which satisfies R < Rfb and R < C, a strictly positive second order error 
exponent is achievable: 

E2{R,Rfb,P)>0. 

Proof: See Appendix. ■ 

The above result can be further generalized as follows. 

Theorem 2 Given an integer L > 2, for any R > which satisfies R < j^RpB <^nd R < C, a strictly 
positive L*^ order error exponent is achievable: 

El{R,Rfb,P)>0. 

Proof: See Appendix. ■ 

We use a class of simple iterative coding schemes to prove the above achievability results. In particular, 
to achieve a doubly exponential error decay we propose a multi-phase coding scheme as follows: in the 
first phase, called the initial transmission, the message is sent using a non-feedback code that occupies 
a big portion of the transmission block (ui out of n). In the second phase, called the intermediate 
decoding/feedback phase, the receiver decodes the message based on the received signals and feeds back 
the decoded message to the transmitter, using nR nats of the available feedback. Depending on the validity 
of the decoded message the transmitter decides to stay silent or perform boosted retransmission. In the 
case the message is decoded correctly, the transmitter stays silent during the rest of the transmission time. 
Otherwise, it sends a sign of failure in the next (ni + 1"^*) transmission and uses the remaining portion 
of the transmission block (n2 = n — ni — 1) to send the message with an exponentially (in block length) 



high power. While retransmission with such a large power guarantees a doubly exponential error decay, 
it does not violate the power constraint since the probability of incorrect decoding in the second phase is 
exponentially (in block length) low. 

To guarantee an L— fold exponential decay when the available feedback rate is {L—1)R, for some integer 
L > 2, the above scheme can be modified to include L — 1 rounds of intermediate decoding/feedback and 
boosted retransmission, where retransmission at each round, if needed, is done with exponentially higher 
power than the previous retransmission. 

Note that in comparison with the Schalkwijk-Kailath (SK) scheme presented in [[21, the above iterative 
technique needs less feedback (LR nats instead of the infinite rate required by the SK scheme) and 
provides better error decay rate. 

IV. RpB < R: First Order Exponential Error Decay 

In the previous section we have shown that by utilizing a feedback link with a rate higher than the 
forward rate, we can reduce the error probability significantly as compared to the case with no feedback. 
The high reliability of the iterative scheme presented in the last section is due to the fact that the initial 
decoding error at the receiver (which is a rare event) is perfectly detectable at the transmitter. Therefore 
it can be corrected by retransmitting the message with high power without violating the average power 
constraint. The perfect error detection at the transmitter is obtained from the feedback of the initial decoded 
message at the receiver. However, when the feedback rate is lower than the forward rate, the receiver has 
to use a source code to compress its decoded message before feeding it back. The transmitter must then 
reconstruct the uncompressed decoded message to detect any error. Since this reconstruction involves some 
first order exponential (in blocklength) error decay (corresponding to the source coding error exponent), 
the error detection is erroneous with the same decay rate. Therefore, the mis-detection of the receiver 
error due to the compression on the feedback link dominates the error probability. 

While the above intuitive explanation justifies the failure of the block retransmission schemes in 
achieving a super-exponential error decay, one might still hope that such a decay rate can be achieved 
using other schemes. For example one alternative is to look at the problem from a stochastic control point 
of view and use a rate-limited variant of the recursive feedback schemes presented in [13 j and [fT4l . In 
this section, we show that no matter what communication scheme is used, one cannot achieve infinite first 
order error exponent. 

Theorem 3 Given R > Rps, the first order error exponent is upper bounded by 

Ei{R,RpB,P) < Eup{Rfb), 

where Eup{Rfb) = 4P + Tq/2 + Rpg and Tq is the solution to |(ro — 1 — log(ro)) = Rfb- 

Proof: See Appendix. ■ 

The proof, which is rather lengthy, can be explained using the following observation. It is shown in 
ifTSl that given a peak power constraint, the best achievable error decay is exponential. Therefore, in 
order to achieve a super-exponential error decay, the transmitter should be able to boost the power under 
certain circumstances. However, given the expected power constraint, the power can be boosted only under 
rare occasions where the receiver would decode wrongly otherwise. Therefore, there should be enough 
feedback bits to communicate the occurrence of those rare occasions to the sender. It turns out that this 
requirement is met only if the number of possible feedback messages (e"^^^) is at least as large as the 
number of forward messages (e"^). 

Note that the error exponent upper bound provided in the above theorem stays bounded as Rpg 
approaches R from below. On the other hand, we showed in the previous section that for any feedback 
rate higher than R, the error exponent is infinite (doubly exponential decay). These two facts lead to an 
interesting conclusion: the error exponent as a function of the feedback rate has a sharp discontinuity at 
the point Rpg = R. 



The above theorem provides an upper bound on the first order error exponent for feedback rates below 
R. We conjecture that a similar result may be obtained on the boundedness of the L^^ order error exponent 
for feedback rates below LR. 

V. RpB < R: Lower bound on error exponent 

We have shown in the previous section that the probability of error when Rpg < R cannot decay faster 
than exponential as a function of the blocklength n. Although the feedback in this case does not provide 
an infinite error exponent, we still expect that the error exponent should be improved in the presence of 
feedback as compared to the non-feedback scenario. In this section we will show that the error exponent 
with feedback is at least Rp^ above the non-feedback error exponent. The main result of this section is 
the following theorem. 

Theorem 4 For all rates R < C, such that R > Rpg, the error exponent is lower bounded as follows 

Ei{R, RpB, P) > Ej^^pb{R) + RpB, (3) 

where Ef^^pglR) is the error exponent for the AWGN channel in the absence of feedback. 

Proof: See Appendix. ■ 

The achievability scheme for the above result is constructed using the multi-phase scheme proposed 
in the proof of Theorem [H in conjunction with a compression technique to reduce the rate of feedback 
in the intermediate decoding/feedback phase from R to Rpb- Using such a scheme, the error probability 
is dominated by the probability of error mis-detection. This error term is the product of the probability 
of error in the initial transmission phase (exp(—nEj^„pB{R))) and the probability (exp(— ni?^^)) that the 
compression loss hides this event from the transmitter. 

VL Per channel use feedback constraint 

In the previous sections we focused on a scenario where the average rate over the whole transmission 
block was constrained to be lower than RpB- Under that constraint, the receiver can use the available 
feedback (nRpg nats) any time during the transmission. In particular, using the coding scheme proposed 
in Section III, the receiver collects all the feedback bits and uses them in one feedback transmission at 
the end of the first phase. In this section we consider a per channel use feedback rate constraint. Under 
this constraint, the receiver cannot feed back more than RpB nats after each channel use. This translates 
to the following constraint on the size of the feedback signal alphabet at each time i E {1, ...,n}: 

H\ < e^^^. (4) 

Given that the above constraint is more restrictive than the average feedback rate constraint considered 
previously, we can conclude that the upper bound on the error exponent obtained in Section IV holds 
in the above scenario as well. Interestingly, we show that similar achievability results as those stated in 
Section III for the average feedback rate constraint are also true for the per channel use feedback scenario. 

Theorem 5 Given the per channel use feedback constraint, if Rpb > {L — 1)R and R < C, a strictly 
positive L''^ order error exponent is achievable: 

El{R,Rpb,P)>0. 

Proof: See Appendix. ■ 

The above result is proved using a combination of the scheme presented in Section III and a block Markov 
coding scheme which is described in the Appendix. Figure [2] illustrates an example of this iterative coding 
scheme for the case where L = 2. 
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Fig. 2. Iterative feedback scheme for per channel use feedback constraint: An example 



VII. Summary and Discussion 

We considered the impact of rate-limited noiseless feedback on the error probability in AWGN channels. 
We first showed that if the feedback rate Rpg that exceeds the rate R of the data transmitted on the forward 
channel, one can achieve a super-exponential decay in probability of error as a function of the code 
blocklength. Our achievability result is based on a multi-phase scheme in which an initial transmission 
of the message, if decoded incorrectly, is followed by the retransmission of the message with boosted 
power. A key requirement in this scheme is for the transmitter to perfectly detect the error in the initial 
transmission every time it happens. The minimum feedback rate required to perfectly communicate the 
initial decoded message is R and therefore our scheme fails to achieve a super-exponential error decay 
for RpB < R. We showed that this is true for any scheme. That is, Rpg > R is also a necessary condition 
for achieving a super-exponential error decay. While we provided an upper bound for the error exponent 
when RpB < R, we also showed that even in this case, the use of feedback increases the error exponent 
by at least Rpg. For the case in which Rpg > (L — l)R, for some positive integer L, we generalized our 
multi-phase iterative scheme to prove the achievability of an L— fold exponential (in blocklength) error 
decay. The above results are illustrated in Figure [3l It can be seen that the error exponent as a function 
of the feedback rate has a sharp discontinuity at Rpg = R. 

We showed that the above necessary and sufficient condition for achieving a super-exponential error 
decay holds whether the feedback limitation is expressed as a constraint on the average feedback rate 
or on the per channel use feedback rate. Note that our results address the asymptotic behavior of the 
probability of error in terms of the blocklength n and therefore may provide limited insight for codes 
with small blocklength. In particular, for small values of n, one might expect the per channel feedback 
rate constraint to lead to a higher error probability than a scenario with average feedback rate constraint. 
On the other hand, the former is a more practical scenario as it implicitly captures the delay associated 
with sending data on the feedback link. 

In this paper we showed the advantages of feedback in terms of improving the decay rate of the error 
probability. A subject for future research is to explore the other advantages of interactive communication 
in terms of reducing the coding complexity and energy consumption. One interesting problem to be 
addressed is how to use rate-limited feedback to construct SK like schemes which do not need complex 
block encoding decoding. 



VIII. Appendix 

Proof of Theorem \J^ Fix 5 > such that R < C{1 
where e > is chosen such that 
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5). Define n2 = en and ni 



n - 77,2 - 1, 

(5) 
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Fig. 3. An illustration of the bounds on error exponents in terms of the feedback rate Rfb 



holds for large enough n. Choose the feedback signal domains as follows 

Ui = {1}, for i ^ rii 

We construct two non-feedback codes ^i = (ni, — , P) and ^2 = (^2, — , P/l), where 



(6) 



For m E {!,..., e"^}, pick the corresponding codeword X"^(m) from "^i and send it in the first rii channel 
uses. Based on the received signals F"^ and using the optimal non-feedback decoding function for code 
^1, the transmitter decodes the message and sends back its decision rhi to the transmitter 



If rhi = m, then 

otherwise, the next input will be 



Un, = rhi. 
Xi = 0,i = rii + 1, ..., n, 



Xn,+1 = /P/T 



and then the codeword corresponding to m is picked from the codebook ^2 and is transmitted in the 



remaining ^2 transmissions. On the other side, the receiver compares F„^+i with the threshold T 



If 1^1+1 < r, then the remaining received signals are ignored and the decoded message in the first try is 
announced as the final decision 

m = rhi. 

If 1^1+1 > r, the receiver decodes the message based on the last n2 received signals and using the optimal 
non-feedback decoding function for code ^2- The resulting message rh2 is then announced as the final 
decision 

m = 1712. 

Using the above scheme, the average power used in the forward link will be 

^{n^P + ^{n2){Ph))<P. 
n 

Therefore our scheme satisfies the power constraint. Also the average feedback rate is R which meets 
the constraint on the feedback link. There are three cases in which an error can happen. The first case is 
when the first decoding is correct but the receiver receives a failure signal from the transmitter due to the 
noise on the rii + I'** transmission. The probability of this event is upper bounded by 

Pe{false negative} < g(r), (7) 

where Q{.) is the tail probability of the standard normal distribution. The second case is when the first 
decoding is wrong but the failure signal is not decoded correctly at the receiver. The probability of this 
event is upper bounded by 

Pe{false positive} < g(r). (8) 

The third case is when the first decoding fails and the failure signal is decoded correctly, but the second 
decoding also fails. The probability of this event satisfies 

nR 
Pe{wrong decoding} < Pe{n2, — .P/l) (9) 

= PeK,J,P/7)- (10) 

Using the exponential upper bound for the Q— function, we have 

P 
Peifalse negative} + Peifalse positive} < aexp( ), (11) 

87 

where a > is some constant. By positivity of the error exponent for rates less than the capacity |[T1 and 
since — < C(l — 5^), we know that for any 5 > 0, there exists a fixed C > such that 

7 = Pe(m,^,p') <e-<. (12) 

for large enough values of n. Combining ([TT]) and (fT2l) . we obtain 

Pe {false negative} + Pe {false positive} < exp(-e"(^+°(^^)), (13) 

which shows the probability of the first two types of errors decays doubly exponentially in the blocklength. 
It remains to show that the third type of error is also upper bounded by a doubly exponential term. To 
show that, note that on the right hand side of (flOl) . the rate is at most 1/e times the capacity achieved by 
SNR P. However, the SNR P/7 is exponentially (in n) higher than P 

P/7 > Pe"^, 



10 

for large values of n and therefore 

Pe{wrong decoding} < Pe{en, — , Pe"^). (14) 

Given (fT3l) and the above inequality, the proof will be complete if we show that Pe{en, — , Pe"^) decays 
doubly exponentially as a function of n. To show this, we can use the fact that for communication rates 
(in nats/channel use) less than 

1 2 + v/P^T4 
- m , 

2 4 

the following upper bound on error probability holds in the absence of feedback [[Tl: 

Pe(n,P,P)<e-"(^(^'^)-^'), 
for any e' > and for large enough values of n, where 

E{R,P) = ^(l-v/(l-e-2^)). (15) 

Take n sufficiently large such that 



I.e. 



R 1, 2 + VP2e2< + 4 
— < -In , 

e 2 4 ' 



1 (4e2f-2)2-4 
n > - In .^ ^ 



C P' 



Then using (1151) leads to 



P,{en,R/e,Pe<) < e~^^<^i^-V^^^^)+^') 
= exp(-exp(n(C + o(l)))) 



Proof of Theorem |2l- Let's partition the whole transmission block into L + 1 sub-blocks, the first 
of which has length (1 — e)n. We choose the remaining sub-blocks to have equal lengths. In the first 
sub-block, the transmitter sends the message using the non-feedback Gaussian codebook ^i with rate R 
and power P. After transmission in the z*'^ sub-block, the receiver feeds back the message it has decoded 
within that sub-block. If the decoded message matches the transmitted one, the transmitter stays silent 
for the rest of the time. Otherwise, it sends a failure alarm and retransmits the message in the i + P* 
sub-block using a non-feedback Gaussian codebook % with rate R. The power of the alarm signal and 
the power Pj of codebook % are chosen to be inversely proportional to the probability of decoding error 
in the first i sub-blocks. That is, 

P,+i = P/7„ 

where 7, is the total probability of error in the first i sub-blocks. The L-fold exponential error decay can 
be shown inductively. Given that the probability 7j is (i — l)-fold exponential in terms of the blocklength 
(the case of i = 2 was shown in the previous Theorem), the power at the i^'^ sub-block (if transmission is 
needed) is {i — l)-fold exponential in blocklength. This in turn leads to an z-fold exponential error decay 
at the end of the i^'^ sub-block. Note that both the transmission power and the feedback rate in the above 
scheme satisfy the problem constraints. ■ 
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Proof of Theorem |21' Let us first introduce some key definitions whicli will be used in our proof. 
We define the decoding region for message m as 

D{m) = {r" :(/)(") (r") =m] 

Also for each feedback signal sequence m" = (mi, ...,«„) £ ^^ let's define the feedback decision region 

A key quantity in our proof is the joint distribution of the feedback signal sequence and the output 
sequence given the transmitted message Pyn^n|j;/(., .|.). For simplicity, we drop the subscript and use 
P{y^\ u"|m) to denote the density of the output sequence y" and the feedback sequence m" = (ui, ..., Un) 
conditional on the transmission of the message m. Defining uq = 0, we can write 

P(l/",w"|m) = IVl=,P{y,\m,u'-\y'-')P{u,\m,u'-\y') (16) 

= U^^,p(^y.\m,u'-\ft\m,u'-'),y^-')p(u,\m,u^-\y\gl''\y^)) (17) 

= n^=,p{y.\ft\my))\^,=,i«^^y.,y (18) 

- l{y"gB(»")}ll»=i ^2n) \ 2 j 

= l{yn^B{u")}{^T^) ' exp I I, (2U) 

where f^'^^{m,u'^) = (/{" (m, uq), ..., /n (m, u"~^)). In this derivation, (fT6l) is a consequence of the 
probability chain rule. Equation (IT71) is derived using the fact that for any two random variables {W, S) 
and any deterministic mapping T(.), W -^ S i-^ T(S) is a Markov chain. Finally, (fTSl) is a direct result of 
the Markov chain relationship (M, f/*~^, y*~^) ^ Xj ^ Fj and also the equation f/j = (7|"^(F*). Another 
quantity of interest will be the probability of using a feedback signal sequence n" G W conditional on the 
transmission of a message m E A4, 

P{u^\m)= I P{y'',u''\m)dy''. (21) 

With the above definitions we can now proceed with the proof. Suppose the theorem does not hold. 
That is, let's assume there exists 7 > such that the following inequality can hold for arbitrarily large n: 

Pe{n, R, Rps, P) < e-"(-^"^(-f^^s)+^). (22) 

Given such n's, the above inequality implies that for at least half of the messages m G A^, we have 

P (m) < 2e~'^^^''^^^^^^~^'^^ = e~"(-^"p('^^s)+'''+°(^)). (23) 

Removing the messages which do not satisfy the above, we obtain a codebook with the rate of at least 
^ log(^) which, for arbitrarily large n, is arbitrarily close to R. Therefore, (l22l) implies the existence of 
a code with rate R for which the per message error probability can be less than its right hand side for 
arbitrarily large n and for some 7 > 0. Let us define s{n) = n{Eup{RFB) +7)- To prove the theorem, we 
will show that there exists no such that for any n > uq, the inequality 

Pe{m) < e"'(") (24) 

cannot hold for all messages m E Ai. Let us fix uq, to be determined later, and assume that for some 
n > Uq, there exists a communication scheme for which (^^ holds for all m. Given such a communication 
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scheme, for each m, we construct an initial bin Fo(m) including a subset of feedback signal sequences 
as follows 

Fo{m) = {m" : P(M"im) > ^e""^^^}, 

where 5 > is a fixed constant, to be determined later. Defining Pr{Fo(m)|m} as J2u"eFo(m) P{u"'\m), 
we can write 

Pr{Fo(m)|m} = 1- ^ P(M"|m) 

> 1 - (^iWle"'^^^^ 

> 1-6 (25) 

In the following algorithm we update the content of each bin sequentially. 

1) Start with i = 0. 

2) Pick two distinct messages m,m' E A^, such that there exists a feedback sequence m" where both 
Fiijn) and Fi{m') include -u". 

3) Assuming ||/*^")(m, m")!!^ > ||/'^")(m',M"')|p (without loss of generality), remove -u" from Fi(m). 

4) Increase i by 1 and set Fi{k) = Fi_i{k), for all k € A^. 

5) S&i J = {keM: Fi{k) ^ 0}. If | J| > e"^^^, go to step 2, otherwise stop. 

Note that step 2 is feasible since whenever this step is executed the number of non-empty bins are greater 
than the cardinality of \U\ which is e"^^^. Therefore, there should exist at least one feedback sequence 
which appears in two bins. Also note that for any A; G A^ and any integer % 

Fi{k)CF,_,{k)...CFo{k). (26) 

Assume 171,711' are the messages picked in step 2 and u" is the sequence removed from the bin Fi{m) 
in step 3 and at iteration i of the above algorithm. Given such a 3-tuple (u'^, 711,711'), a major part of the 
rest of the proof is devoted to obtaining a lower bound for ||/(")(m,M")|p. First for any y", let's use the 
triangle inequality to write 

||y"-/(")(m,«")||2 < (||^"-/H(rn>")|| + ||/(")(m,«")-/(")(m',n")||)2 

= lb" - /^"n^',M")|p + ||/(")(m,u") - /(")K,w")lP 

+2||i/"-/(")(m',«")||.||/(")(m,«")-/(")(m',«")|| 
< 2(||y"-/(")(m',«")||2 + ||/W(m,«")-/(")(m',w'^)|p). (27) 

Similarly, we have 

||/(")(m,w")-/(")K,«-)||2<2(||/(")(m,n")|p + ||/(")(m',«")|p). 

Combining ^7}, dSSJ) and the assumption in step 3 of our algorithm that ||/*^"^(m,M")|p > ||/(")(m',M")|p, 
we have 

||y"-/('^)(m,n")|p<2(||y'^-/(")(m',n'^)||2 + 4||/(")(m,u")|p). 

Using this inequality and the derivation in (l20l) . we have 

P(l/",«"|m) > l|,.,5(„.)}exp (-4||/(")(m,«")|H (27r)-texp (-||y" - /(")(m', n")in . (28) 
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Denoting the complement of a set A by A^, we can write 






Pe{m)= / I >^ P{y\u'^\m) \dy^ (29) 

<D{mY 



> / P{y'',u''\m)dy'' (30) 



> / P(2/",n"|m)dy'^ (31) 

-'D(m')nB(M") 

> exp (-4||/(")(m,«")||2)/^^^,^^^^^„^(27r)-texp (-||3/"-/("'(m',«")||2) dy", (32) 

where (|3T1) is due to the fact that D{m) and D{m') are disjoint sets and the last inequality is a consequence 
of (|28]) . Using the assumption f l24|) and rearranging the above inequality, we can write 



\\f-\m,u-)f > i (.(n) + log^(^,)^^(„„)(27r)-texp(-||,"-/(")K,.'^)||2)rfy-) . 



(33) 



To complete our lower bound for ||/*^")(m,M")|p, in the following, we find a lower bound for the integral 
in (|33]) . First note that since m" e Fi{m), we can write 

P(y",M"|m')rfl/" 

D(m')nB(u") 



= P{u''\m')- P{y'',u''\m')dy'' 

J D(m'Yr\B(u") 

>P{u''\m)-Pe{m') 

> (5e~"^^^ - e-'(") (34) 

~ 6 

> -nflfS^ (35) 

where (|M)) follows from the assumption that (|2^ holds for all the messages and the fact that u"- picked in 
step 3 and at the i^^ iteration of the algorithm is in bin Fi{m') and therefore is a member of FQ{m'). Also 
inequality (l35l) is secured by the appropriate choice of uq. Now let's define the sphere Sp{f^"\m',u"')) 
as 

5p(m',w") = {y" : lb" - /(")(^',«")ir < ^^}, (36) 

where r will be determined later. Partitioning the set D{m') fl B{u"') into D{m') fl BivT-) fl Sp{'m',u'^) 
and D{m') fl B{u^) fl Sp{'m\u^Y and using (ES]), we can write 

/" P(l/'^,M"|m')rfy"> -e-"^^«- /" P{y'',u''\m')dy''. (37) 

The second term in the right hand side of (ITTl) can be bounded as follows 

/ P(y",w"|m')rf2/" 

-'D(m')nB(u")nSp{m',u")= 



< / P(|/",M"|m')rf2/" 



' Sp{m',u"Y 
n 

i=l 

<exp(-nP,(r)), (38) 
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where we have used the Chernoff bound in the last step. In that inequality Ec{t) is defined as 

Ec{t) = maxsr — fi{s), (39) 

s>0 

where fi{s) is the semi-invariant moment-generating function of the Chi-square distribution corresponding 

to K={y,-fP{m',u^-')f: 

/i(.) = logi?.[e^1 = I log(Y^). (40) 

Replacing n{s) in (l39l) and optimizing that equation we obtain 

E,(r) = ^(r-l-log(r)) (41) 

which is positive and increasing for all r > 1 and tends to infinity as r — )■ oo. Choose r such that 

E,(r) > R^^ + e, (42) 

for some e > 0, to be determined later. Using fl37|) and fl38l) we can write 

J D(m')nB{u")nSp{m' ,u") 
A 9 

2 (5 

>_g-n/?fs^ (44) 

where we guarantee the validity of the last step by the appropriate choice of uq. Now let's derive the 
lower bound for the integral in (!33|) as follows 

(27r)-"/2exp (-||y" - /(")(m',M")||2) rfy" (45) 

i:){m')nB{u") 

> / (27r)-"/2exp (-||y" - /(")(m',w")||2) rfy" (46) 

JD{m')nB{u")nSp(m',u") 

> e-»''^ / (2.)-»/^exp /Jly" -/'"'("-'■ -"")ll') ,,„ (47) 

J D{m')nB{u")nSp{m' ,u^) \ ^ / 

= e-""/2 /" P(l/",M"|m')rfy" (48) 

~'D(m')nB(u")nSp(m',u") 
> ^g-'^(^/2+i?Fi3)_ (49) 



The inequality (H9l) along with ( 133|) lead to 



II/^"^K^")IP > 1 (^ _ Ml) _ r _ ^^^^_ ^^^^ 

n ~ 4 n n 2 

Substituting s(n) = n{Eup{RFB) + 7) in the above inequality, we obtain 

Mi!Wf!M>P + i(^_L^:£_M|)). (51) 

n A 2 n 

By choosing e in (|42]) small enough such that ^-g^ + °^^* < 7/2, we conclude that for any feedback 
sequence m" which is dropped in any iteration of our algorithm: 

\\f(-)(m,un\\'>n{P + l). (52) 
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The above inequality is sufficient for us to prove the theorem. Noting that the cardinality of the set J at 
the end of our algorithm is e"^^^ , we can write 



m,U'-^)]] (53) 



n 

j=i 



m£M ' ' u"&U 



meM\J u"€Fo{m) 



Ml ^ 



>TmT ^ ^ PK|mMP+^) (56) 



J2 Pr{FoMI^}- (57) 



' ' meM\J 
meM\J 

>n(P + — )(l-e-"(^-^^^)) (59) 

16 

> nP (60) 

In the above derivation, (|56|) is obtained using (152)) and the fact that for all m E A^\J, all the m"'s in 
Po(^) are removed at the end of the algorithm. Also, (!58|) is a consequence of (p5|) and (1591) is satisfied by 
choosing 5 < jqpTq-- The last inequality is secured by the appropriate choice of uq. The above inequality 
shows the conflict of the power constraint and the assumption that fl24|) can hold for some n > no, where 
no is chosen such that for any n > uq 

6 2 

je— <^, (62) 

2 

^-n{R-RpB) ^ 7 ^ (^3) 

16P + 7 

Given the assumption of Rpg < R, it is clear that there exists no such that all the above three inequalities 
hold and this completes the proof. ■ 

Proof of Theorem^- We prove the achievability of the above error exponent using an iterative scheme 
similar to the one used in the proof of Theorem [TJ We use the exact same structure and notation as in 
the previous iterative scheme and just express the distinctions of this scheme. The main distinction is that 
here, instead of feeding back the decoded message (i.e. f/„j = mi), the receiver sends back a function of 
its decoded message 

Un,=g^''\m,), (64) 

where g^"^^ : A^ i— ;■ {1, ...,e"^^^} is the feedback decision function. After receiving 17^, the transmitter 
compares the received feedback with the feedback corresponding to the original message and stays silent 
if 

Otherwise, it sends the failure alarm and retransmits the message with high power exactly similar to what 
was described in the proof of Theorem [TJ Considering the range of the feedback function g^"'\.), it is 
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clear that this scheme meets the feedback constraint. Also it is easy to show that the power constraint is 
also met. In particular, note that the probability of retransmission in our scenario is 

which is less than or equal to 7 = Pr{m 7^ mi} and therefore the expected power used here is less than 
the case considered in Theorem [T] Also note that the types of errors seen here include the three types of 
errors in the earlier case (false negative, false positive and wrong decoding at the receiver) plus the error 
due to the fact that a subset of the decoding errors in the first block are not recognized by the transmitter. 
That is, the error corresponding to the event 

{m ^ mi,^(")(m) = (/("^(mi)}, 

which we call an error mis-detection event, must also be considered as a possible error event. We showed 
earlier that the algorithm in Theorem [T] achieves a doubly exponential error decay, where the error is 
associated with the first three types of errors. Therefore, the probability of error for the current scenario 
can be upper bounded by the sum of two terms: the probability associated with an error mis-detection 
event and the probability associated with the other three types of errors: 

Pe{n, R, R^^, P) < Pr{m ^ mi, g^^\m) = ^("H^i)} + exp(-exp(n(C + o(l)))), (65) 

for some C > 0. Given that the feedback rate is less than the feedforward rate, we expect the error mis- 
detection event to dominate the total error probability. Hence, the proof will be complete if we show that 
there exists a sequence of feedback encoding functions {fi'^"''(-)}^i such that 

Pr{m ^ mi,^(")(m) = ^("^^i)} < exp( - n{E^^^^{R) + R^^^^ + o(l))). (66) 

We show the existence of such a feedback encoder sequence using a random coding argument. Given n 
and a feedback function g^'^^ : A^ h^ {1, ..., e"^^^} , let's define the set V(")(j) for each j G {1, ..., e"^^^} 
as 

V(")(j) = {meM: g^^Xm) = j}. 

We can observe that, in fact, determining the function g^"'\.) is equivalent to partitioning {1, ..., e"^} into 
the sets {V^^\j)}'j'Lf^ . Now let's consider all the possible feedback functions for which 

for all J G {1, ..., e"^^^}. That is, let's consider all the equal partitionings of the set {1, ..., e"^^^}. From 
this set of functions, let's pick the function g*{.) uniformly randomly and use it as the feedback encoder 
function. We denote the partitioning associated with g*{.) by {V*(j)}jl/^. Now let's compute 

^[Pr{m ^ mi,^(")(m) = ^("^(mi)}], 
where the expectation is with respect to the randomness in picking the feedback function. We have 

_E[Pr{m 7^ mi,5f*(m) = 5f*(mi)}] = 

^[Em=i Pi'l"^ is sent} E»eAi,i^mPr{mi = i\m is sent}l|g.(i)=g.(^)}] = 
E^=i Pr{"^ is sent} Y.i&M,i^m P^l^i = i\m is sent}E[l|g*(i)=g,(^)}] . (67) 

For each pair {i,m), we can write 

E['i-{a*ii)=g'(m)}] = Pr{g*{i) = g*{m)} 

= Y^ Pr{^*(i) = k\g*{m) = fc}Pr{^*(m) = k} 

k=l 

gURpg 

= ^ Pr{iG V*(A;)|mG V*(A;)}Pr{mG V*(A;)}. (68) 



fc=i 
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Since {V*{j)}fL/'^ is uniformly randomly chosen from all equal partitionings of {1, ..., e*^^}, we can 
write for 2 7^ m and for any k E {1, ..., e"^^^} 



Fi{iE V*(A;)|me V*(A;)} 



Substituting the above equality in (168]) we get 



|V*(A;)|-1 



E^LTlvWl-i 

^n{R-RpB) _ Y 



quR _ I 



^n{R-RpB) _ I 
E['i-{g*{i)=g*{m)}] = ^nR _ l ' ^^^^ 



We can now combine ((69]) and ((67)) and conclude 



<,Tl-R 



^n{R-RFB) - 1 ,-^ ^ 

E[Pi{m 7^ mi,g*(m) = g*(7fii)}] = — >, P^l^^ is sent} 2, Pr{mi = i\m is sent} 

m=l iGM,i^m 

= g""(^Fs+o(i))pj^|£)gpQjj^g gj^j^Qj^ ^^ gj,g^ block} 

The above inequality implies that the expected (with respect to encoder selection) probability of error 
mis-detection event is less than the right hand side of ((661) . Therefore, we can conclude that there exists 
at least one feedback encoding function among the ones from which we randomly selected that satisfies 
([66]) . This completes the proof. ■ 

Proof of Theorem\5\- Here, we only present the proof for the case where L = 2. Following a similar 
approach as in Theorem [21 the proof can be extended to L > 2. 

For each R < C, there exists 5' > such that R < C(l — 6'). Let's fix 6' and consider the integer k 
which satisfies 

-<l<5'. (70) 

2 - k 

We divide the whole transmission block into k sub-blocks each with length / = n/k. We then partition each 
sub-block into three parts of lengths /i, 1 and I2 exactly the same as the partitioning in the 3— phase scheme 
proposed in Section III. In the first portion of sub-block j E {I, ...,k — 1}, message rrij which contains 
nR/{k — 1) nats of new information is transmitted on the forward channel using a non-feedback Gaussian 
codebook similar to the first phase of the algorithm described in Section III. After the transmission, this 
message is decoded and the decoded message rhj is transmitted back on the feedback channel during the 
first portion of the j + 1** sub-block and with the rate R nats per channel use. By the end of the feedback 
transmission (end of the first portion of sub-block j + 1), the transmitter can detect the decoding error. If 
rhj 7^ rrij, the failure alarm is sent in the second portion of the j + 1** sub-block and the message rrij is 
retransmitted with high power in the third portion of the j + P* block. In fact, for each sub-block we apply 
the 3-phase iterative scheme of Section III with the distinction that the error detection and retransmission 
for each sub-block occurs one sub-block after the original transmission. The forward rate per channel use 
in each sub-block is 

kR 1 

<Cil-5')il-y)<Cil-6'y. 



k-l ' '' k' 

Defining 5 = 25', the rate per channel use will be less than C(l — 5). Using the results of Section III, 
we can conclude that there exists C > such that the error probability P^ for the message rrij is upper 
bounded by 
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n 5'C 

Pi < exp(-exp(-C)) < exp(-exp(n— )), 

where the last inequality is a consequence of (|70|) . Using the union bound, the total error probability will 
be bounded as follows 

fc-i 

Pe < Y.P' 

i=i 

< {k- l)exp(-exp(n— )) 

< ^exp(-exp(n— )), 

where the last inequality is again a consequence of (ITOl) . Taking 6 = ^, the above inequality completes 
the proof. ■ 
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